BLOCK-DIAGONALISATION OF MATRICES AND OPERATORS 
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Abstract. In this short note we deal with a constructive scheme to decompose a continu- 
ous family of matrices A(p) asymptotically as p ^ into blocks corresponding to groups of 
eigenvalues of the limit matrix ^(0). We also discuss the extension of the scheme to matrix 
families depending upon additional parameters and operators on Hilbert spaces. 



1. Matrix theory 

1.1. Preliminaries. We first recall some well-known facts about matrix equations of Sylvester 
type and their solution. Let G C™^™ be two matrices with 

Re spec > 0, Respect" < 0. (1) 

Then a solution to the Sylvester equation 

A+X - XA- = B (2) 

for a given right hand side B G C™^™ can be represented by the integral 

X= / e-*^^Be*^"di. (3) 



Indeed, by assumption ([T]) we know that there exists a constant c > such that the matrix ex- 
ponentials satisfy He^*"*^]! ||e*'^ || < e"'^* and the integral converges exponentially. Furthermore, 
plugging ^ into ^ immediately yields 



A+X - XA- = J (A+e-'-^+Se*^- _ e~'^* Be'^~ A-)dt = - A(e-*^^Be*^")dt = B. 

The special case, where A~ = — is known as Lyapunov equation and plays an essential 

role in control theory. For some details see, e.g., [Dat99| . 

We will apply this representation in a slightly modified form. Assume for this that there exist 
numbers a G = {a G C : a ^ 0} and /3 G M such that 

Re(aspecA+) > /3, Re(Q!specA~) < /3. (1') 

Then ^ can be solved by an integral of the form ^ replacing the path of integration by a 
suitable ray in the complex plane. 



aoo 



X^ / e-'^ Be'^'dt, (3') 



(or simply by dividing equation ([2|) by a) . Condition |PJ means that the spectra of and A 
are separated by the line {C G C : Re(Q!C) = /?}. 
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1.2. Matrix families and spectral block-decomposition. We proceed to our main topic 
and consider a family of matrices A{p) G C™^™ depending upon a (real or complex) parameter 
p. We assume it has a full asymptotic expansion as p ^ 0, 

A{p) - Ao + pAi + p^A2 + ■■■ , p-^0, (4) 

meaning that the difference of A{p) and the first N terms on the right is of order 0{p^). 

We assume further that the spectrum of the matrix Aq is decomposed into groups of eigen- 
values, 

spec = [J 6 j , conv &i n conv 6 j = 0, i ^ j, (5) 

such that any two components &i and &j have separated convex hulls. Then the following 
statement holds true. 

Theorem 1. Assume @ and Then there exists an invertible matrix family M{p) having a 
full asymptotic expansion as p —> such that the matrix 

M-\p)A{p)M{p) (6) 

is block- diagonal modulo H 0{p^) with blocks corresponding to the groups of eigenvalues given 
in ©. 

We will prove the following equivalent statement in a purely constructive way. The prove 
follows the standard scheme from [JW08., Section 2.1]. 

Theorem 2. Assume ^ and ([5]). Then there exists for any number N 

(1) matrices Mq, . . . , Mm-i G C*"^™, A/q invertible, and 

(2) matrices Aq, . . . , A^v-i; block- diagonal with blocks corresponding to the partition of eigen- 
values given in ([5]), 

such that 

CN-l \ /N-l \ /N-l \ 

E p'M. - E p'M. E p'^>^ - oip"")- (7) 

Proof. Without loss of generality we may assume that Aq is already of block-diagonal form, 
Aq ~ b-diag(5i, . . . , Sd) with d = \T\ and spec5j = &j. Then the corresponding statement with 
= 1 is valid with matrices Mq = I and Aq — Aq. 

Now assume that the statement is already proven for a certain number N — £. We are going 
to construct the matrices Mi and A^ in such a way that the statement with = ^ + 1 follows. 
Since we assumed that A(p) has a full asymptotic expansion the matrix family 

Biip) = Aip) (j2 p'^A ( E p'^'A ( E p'^^) = ^ip') (8) 

\fc=0 / \fc=0 / \fc=0 / 

has a full asymptotic expansion. We denote its leading coefficient as Bg = limp^o P~^Bi{p). Then 
we define Ae = b-diag-Bf, where b-diag selects the block-diagonal according to the partition (O, 
and define Mi to be a solution to the commutator equation 

[AQ,Me]+Bi-Ae = Q. (9) 

Before we are going to solve ([9|), we conclude the induction argument. If we consider the next 
approximation Be^i{p), we obtain 

Bi+i{p) = Be{p) + p'{[AQ,Mf] - A,) + 0{p'+') - 0{p'+^) (10) 
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by definition of B( and ©. In order to solve 
respect to the partition 



we write Mg and Bg as block matrices with 



Mi = 



such that (O reads as system of equations [Sj , 



Ad,d) 



[Br 



B 



(l,d)N 



Bf = 



\Bg 

and 



B 



{d,d) 



-B 



(11) 



(12) 



Equation (fT2ll can be solved by the method already described in Section lTTH while for the diagonal 
entries we may simply define m''/'^^ =0. □ 



On the proof of Theorem]^ To conclude the proof of the first theorem we have to justify two 
things: First, to any sequence of matrices (or resp.) there exists a (smooth) family of 
matrices M{p) (or A(p) resp.) with the prescribed asymptotic expansion. This is a well-known 
fact from asymptotic analysis and a direct consequence of Borel's theorem. Second, because 
the set of invertible matrices is open within C™'*'", the invertibility of M(0) = Mq implies 
invertibility of M{p) for sufficiently small p. □ 

1.3. Perturbation theory and further additional parameters. Now we assume that the 
matrix families depend continuously on a further parameter A{p,v), w G T, T being a compact 
metric space, such that the asymptotic expansions ^ are uniform with respect to the parameters 
(and have coefficients Aj{v), v € T, which are uniformly bounded with respect to the parameter 
v) and the separation condition (O is uniform in the parameter, i.e., there exists a constant 
S such that between any two (parameter-dependent) groups of eigenvalues &i{v) and &j{v) a 
(parameter-dependent) closed separating strip0 of width S can be placed. 

Corollary 3. Under these assumptions Theorems]^ and\^ hold uniform in the involved param- 
eters V €z T , especially all occuring matrices are uniformly hounded with respect to v. 

Proof. It suffices to provide a-priori bounds on the matrices Mg constructed within the proof of 
Theorem [21 If we assume the above given uniform separation of the components &j{v) of the 
spectrum of Aq, applied to yields 



\\M^'''\v)\\ < / ||e-*"('^)^*('')B(*-''Ht;)e*"(^)^^(^)||dt < C(f)||sf^')(t;) 



with constants C{v) estimated via 



Civ) 



||g-ta(i;)Si(i;)|| ||e*"(")'Sj(") ||dt < / Qt\\Siiv)\\+t\\S,(v)\\ ^ 2 



e-''dt 



(13) 



(14) 



to Jo JT 

for T choosen big enough. The choice of T is based on the spectral radius formula, which implies 
that 

||g±t(a(.)s./,W-;3W)||i/t < 2r,pec(e±("(")^-/^(")~'5(-))) = 2e~^^^ (15) 

for all t big enough. Because the left hand side is continuous in v and the estimate for fixed t 
implies the same estimate for 2t by submultiplicativity of the matrix norn|l , the constant T may 
be chosen locally uniform in v. By compactness of T the constants C{v) are uniformly bounded. 
Following the inductive scheme, we obtain bounds for all matrices uniform in u S T. □ 



^I.e. we assume that &i{v) and &j{v) are both disjoint to a strip {(^ G C : | Re(a(i))(J — | < <5/2} with 

suitable complex numbers a{v), f3{v) G C, |a(f)| = 1, depending continuously on the parameter 11 G T. 
2||e*-4||i/t < c implies \\e'^^A\\'/^ = He^-^e'-^Hi/t < \\eiA\^2/t < ^2 ^^us ||e2*'4||i/(2t) < 
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Remark. This is of particular interest, if Aq{v) has multiple eigenvalues for some v G T, be- 
cause then Theorem [T] gives a constructive approach to separate the group of these degenerate 
eigenvalues from the remaining spectrum in a uniform way. 

Remark. In particular cases the estimate (jl3p can be improved. If we assume that the matrices 
Si{v) and Sj{v) are both normal, the spectral radius of the semigroups coincides with their norm 
and hence the integral can be estimated directly to conclude C{v) — S^^. If both matrices are 
uniformly diagonalisable, the estimate has the form C{v) = C5^^, where C is the product of the 
condition numbers of diagonalisers to Si{v) and Sj{v). 

1.4. Multi-step schemes. In |JW08[ Section 2.2], |Jac08| multi-step diagonalisation schemes 
have been considered. They are based on a similar block-diagonalisation, but of minimal block 
size (i.e., blocks correspond to single eigenvalues of Aq) and — based on the assumption that 
corresponding components of Aq are diagonable — refined by applying the block-diagonalisation 
scheme to the terms of lower order inside these blocks. The scheme can be iterated and if a 
related hierarchy of conditions is satisfied, allows for perfect diagonalisation. 

We will not go into the particulars of this algorithm, but draw a consequence from its basic 
idea in combination with Theorem [TJ We do not need any particular assumptions besides the 
existence of the full asymptotic expansion (jj]) in order to conclude the following normal form of 
the matrix family A{p). 

Theorem 4. Assume @. Then there exists an invertible matrix family M{p) having a full 
asymptotic expansion as p such that the matrix 

M-\p)A{p)M{p) (16) 

is block- diagonal and each of its blocks is either of dimension 1 or of the form 

Pfc(p)I + /-J + 0(/+i), (17) 

where pk is a polynomial of degree at most k and J is a Jordan type matrix, i.e., where the only 
non-zero entries are 1 and located on the first upper side diagonal. 

Remark. Wc do not claim that J is a Jordan block, if different Jordan blocks to the same 
eigenvalue appear at a certain stage the scheme can not disentangle them any further. 

Proof. We apply Theorem [T] for the trivial partition of spec into its elements. This defines a 
first family of diagonalisers Mi{p) block-diagonalising A{p) up to infinite order. Consider now 
one of the resulting blocks, which is of the form 

S, + pK^^^ + p^A^^'^ + ■■■ ,p^0, (18) 

with leading term Sj satisfying | spec5j| — 1. We distinguish three cases: 
First case. The matrix Sj is of dimension one. 

Second case. If Sj is not diagonalisable. Then we can choose an invertible matrix M2 in such 
a way that it transforms Sj into its Jordan normal form Xjl + .Jj. 

Third case. If Sj is diagonalisable, it immediately follows that Sj — Xjl and we can apply 
Theorem [T] to p(A^"''' + pAj"''' -I- • ■ • ), which gives a second diagonaliser M2 \p) transforming the 
lower order terms. 

Collecting the matrices into M2(p) = dia.g{M^^\p) , . . . , M^'^\p)) , we obtain an invertible 
family resolving blocks one step further. In order to conclude the proof of Theorem 3] we iterate 
the procedure in the third case, which will either reduce the size of blocks and thus terminate 
after finitely many steps or lead at some point to a matrix f{p)l. There is no other possibility, 
because if in the third case a certain number of eigenvalues coincides up to infinite order the 
corresponding block Sj -\- pA'f^ -\- p^h!"-^^ can only have multiples of the identity as coefficients 
(since the first non- identic one could not be diagonalisable otherwise). □ 
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Even though the blocks have a relatively simple form, it is hard to draw any strong conse- 
quences on their spectrum from this representation. The spectrum of a small perturbation of 
large Jordan blocks and more generally Jordan type matrices can be quite far away from the 
spectrum of the Jordan matrix itself as |DH06| pointed out. 

2. Operator formulation 

We will finally give some remarks on similar approaches for operators on Banach or Hilbert 
spaces instead of finite-dimensional matrices. If all operators involved are bounded and we con- 
sider finitely many spectral components, the statements of Theorem[l]and[2]transfer immediately 
with the same proofs. To decompose into infinitely many components appears to be problematic 
in general because then the constructed block operator matrix Mi may become unbounded due 
to the non-uniformity of the decomposition ([5]) . 

To obtain a non-trivial situation we consider a pencil of unbounded operators on a Hilbert 
space H, 

A{p)^Ao + pAi, (19) 

Aq : H D T>{Ao) — > H closed and self-adjoint such that 'D{Ao) becomes a Hilbert space with 
respect to the graph inner product, and Ai : 'D{Aq) TL bounded. Then the pencil A{p) is 
closed for all p with domain V{Aq). 

For the following we assume that Aq is invertible and that its spectrum has a positive and a 
negative component. Thus we can write Aq as block-diagonal matrix Aq — Aia.g{A^ , A^) with 
a positive operator acting on the positive spectral subspace TL^ and a negative operator A^ 
acting on the negative spectral subspace Ti.^ . Furthermore, we write Ai as block matrix 

with components mapping A'l : 'D{A^) 7i+ etc. In the special situation that Aq is the 
Dirac operator (after applying the Fouldy-Wouthousen transformation to make it diagonal) and 
Ai a self-adjoint perturbation of a particular structure the Douglas-KroU-Hess scheme block- 
diagonalises the pencil A{p), see e.g. [JH89j . |SS06| or [LTOlj for a discussion of a related 
approach. 

Here we want to investigate how the proof of Theorem can be generalised to the above 
situation and used to construct a suitable block-diagonaliser. We will distinguish two different 
scenarios. 

2.1. Arbitrary perturbations and bounded diagonaliser. We will not make restrictions 
on Ai here, thus we treat non-selfadjoint perturbations. In order to apply the scheme, we need 
to solve a Sylvester type problem 

A+X-XA-=B (21) 
for an arbitrary operator B : D 'D{A^) 7i+. Again we want to represent X as integral, 

/>OC 

X = e-*^^Be*'^"di (22) 

but now we have to explain the meaning of it. The semigroups e*^ and e~*^^ are contractions 
(with norm estimated by e~^* for a certain constant 5 measuring the distance of from the 
spectrum of and A^). The interpretation of the integral (j22p depends on mapping properties 
of the operator B. 

It seems natural to introduce the sequence of Hilbert spaces H^^^ = P(|A(±)1T) for 7 > 
endowed with the inner product 

(x,y)„(±, =(|A(±)ra;,|A(±)ry) (23) 
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and describe mapping properties of B as boundedness between such spaces. We use the notation 
-^71,72 •= '^(''^71 '^72) Banach space of bounded operators. Note that gives an 

isometric isomorphism between 7i^^ and 

If S G /Bo.o, then the integral (|22p exists as Bochner integral and defines a bounded operator 
X G Sq.o- As we will see below, in the sense of an improper Riemann (-Bochner) integraQ, 



B = _ y " A (c-*^^i?e*^" ) At = A+X - XA- 



(24) 



such that X : 'D{A^) X>(A+) and (HH) follows directly. If S G Si,o, then the integral exists 
as Bochner integral and defines a bounded operator X G Bi^, i.e., an in general unbounded 
operator X : H~ D 'D{X) but further assumptions are needed to check (PT|) and to 

ensure the boundedness oi X. A possible way around this is given in the next statement: 

Lemma 5. Assume that B G S-y.o for some 7 G [0, 1). Then (|22p exists as Bochner integral and 
defines X G Bs^s for s G [0, 1). Furthermore, X : V{A^) V{A+) and satisfies (HI]). 



Proof. By the spectral theorem for self-adjoint operators we know that 

'S'^e-^^ t > 

for all 7 > and with S = inf spec \ A^ \ — inf spec \ A~^\ (w.l.o.g.). Therefore, for s, 7 < 1 

/•oo 

ii^iib.,. < / iiiA+re-*-4+iie(„+)iii3ib„oiii^-r"V^nb(«-)dt 



< ll^lls^.o { I't' --(-.^)d< + / e-2^*dt ) < 00. (26) 



\l3-,.o 

It remains to check (HH). Let x G P(^") = ■ Then 

Bx = lime-*^^i3e*^"x = - ^ (e-'^^ Be'^~ x) dt 



oc 



= A+ [J e-'"^ Be'^'dtj x-i^J e-*^ Se*^"dtj A'x, (27) 
where we used that for each t > the operator e^*'^* G B{Hf ,H^), = Clryo'H^. Hence, 

/oo 
e-*"^^ Be*'^'dt = Bx + XA-x eH+ (28) 

and by closedness of A~^ we conclude that Xx G T>{A'^) — TC^ and the left hand side is equal to 
A+Xx. Thus, dSIl) together with the mapping property X : 2?(A-) X»(A+) follows. □ 

Remark. It is straightforward to give explicit bounds on the norms H^Ubo ^^'^ II^IIb^ ^- They 
can both be estimated by 



max(l|X||5„,„,||X||B^,J|) < \\B\\b^, (e^(i+in7) ['"r^e-'^At + 5^ 



■y/S 



y-1 I le-y 



<\\B\\B^,„6''-'^-^ + ^j. (29) 

Especially for 7 = we get ||X||eoo — (2'^)^^ ll^llso normal matrices, cf. the remarks 

after Corollary [31 For 7 1 or (5 ^ the bound blows up. 



^We use the suggestive notation JZ^^ = linia/\^a /„/ ■ 
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Theorem 6. Assume that for a certain 7 £ [0, 1) we know Ai G B{Ti.-y,Ti.). Then there exists 
an invertible family M{p) of bounded operators such that 

M-\p)A{p)M{p) (30) 

is block-diagonal modulo f^O{p^) (in the operator-norm sense M^^{p)Ti,i "H). 

Sketch of proof. The proof works analogously to the one of Theorem [2 replacing all matrices 
by the corresponding operators. We have to make sure that the corner entries ' ' and i?^ ' 
always have the right boundedness properties in order to apply (|22p. In the first step this is 
exactly the assumption we made. For the following ones we proceed by induction. If Bi{p) G 
B{Hj,'H), it follows that e B{Hj,H) and that the constructed diagonahser is bounded 
Mi G B{7is,'Hs), s e [0,1) and maps T>{A) into itself. Looking at Bi^i{p) we see that the 
non- vanishing additional terms are of the form 

e. 

Be+i{p) - Be{p) = p'+^A^Me - ^ Af,+i_fcAfe) + 0{p'+^) G B{n^,'H) (31) 

k=l 

and the desired mapping properties follow. □ 

Remark. We know by construction that M(p) : TLi — > Tii, however, we don't know whether it is 
bounded as operator between these spaces. 

2.2. Selfadjoint perturbations and unitary diagonaliser. If we consider only self-adjoint 
perturbations it might be of interest to construct a unitary block-diagonaliser M{p). In order to 
apply a perturbation series method we consider the Cayley transform K(p) of M{p), 

A^(p)= K{p)^pK,+p'K2 + --- . (32) 

If M{p) ^ I in the norm sense as p — > 0, then K{p) will be a family of bounded operators (at 
least for small p), which can be achieved under the same assumptions on the perturbation as in 
the previous section. 

In order to avoid unnecessary repetitions we will only sketch the main difference to the previous 
considerations. Instead of a formulation in the spirit of Theorem [5] we multiply both sides with 
(I + iK{p)) and (I — iK{p)), respectively, and require that 

+ P'Kk) Aip) P'Kk) - f ^ - i E p'Kk) (j2 p'Ak] f / + i E P'K,) 

\ k=l / \ k=l / \ fc=l / \A:=0 / V k=l / 

(33) 

is of order 0{p^) for suitable self-adjoint and bounded operators Kk (having the same mapping 
properties as the Mk in the previous section) and symmetric operators . 

Denoting the p^-part of this expression as Bg, we conclude that the next terms are A^ = diag Bi 
and Kg subject to 

2i[Ki,Ao]^Be~Ai. (34) 

This is again an equation of form (|2ip for the corner entries of the block-matrix K( and the same 
procedure can be applied to its solution. Since A^ are self-adjoint, it follows that the upper right 
corner entry is the adjoint of the lower left and self-adjointness of Kg follows. 

3. Concluding remarks 

Diagonalisation schemes of the form of Theorem [2] separating single eigenvalues and their gen- 
eralisations have been useful for quite a few problems in the theory of hyperbolic and hyperbolic- 
parabolic coupled systems, as already pointed out and discussed in jJW08[ Section 3]. Utilisations 
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of these ideas can be found in |Tay75| , |Yag97| , [RYOOj . [RW05| . |RW08j . [WirOSj to name just a 
few references. They are stable enough to adapt them to diagonahsation schemes within symbol 
classes and to study the evolution of linear systems with variable coefficients. 

The author thinks that the generalisations to blocks demonstrated in this note will be of use 
for applications to more degenerate situations, especially due to the stability under perturbations 
following from the considerations in Section [1.3l 

The statements extent known facts from perturbation theory of matrices, cf. |Kat80] . where 
analytic dependence of matrices upon parameters were treated and representations in terms of 
Dunford were integrals given. 

The considerations in Section [2] allow weaker assumptions on the perturbation Ai compared to 
|LT01j or |KLT04| . where diagonally dominated operators require essentially 7 = 1/2. However, 
our approach gives only an asymptotic decoupling/diagonalisation of block operator matrices 
compared to exact formulas for block-diagonaliser in terms of a factorisation of the Schur com- 
plement associated to the block matrix in [LTOlJ . 
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